An Automaton Model for Xcerpt Type Checking and XML Schema Validation
نویسنده
چکیده
An automaton model used for validation and type checking with languages defined using R2G2 [1] is presented. First, tree-shaped data is considered to be handled by the automaton model, then the approach is extended to graph shaped data. The presented approach is based on specialized non-deterministic finite state automata. The specialisation copes with unranked tree shaped data. Graph shaped data will be treated as, possibly inifinite in depth, trees. The choice of using non-deterministic automata is motivated by complexity issues: as the tree automata are based on regular expressions, non-deterministic automata are a necessary intermediate step. Arguably deterministic tree automata are more eficient on validating data, but the derivation of such automata from non-deterministic ones comes with potentially exponential costs. As all the needed algorithms can be achieved on non-deterministic automata in sub-exponential time and space complexity, no need for determinisation arises.
منابع مشابه
Initial Specification of a Language Extension with Types and Type Checking
Types are a useful mechanism for early error detection and optimization of any programming language. Semi-structured query evaluation as in Xcerpt needs to be able to cope without schema information or types, in contrast to the relational case. Nevertheless, if available, type information enables query optimization well beyond what is possible in its absence. Furthermore, since semi-structured ...
متن کاملRegular Rooted Graph Grammars
This thesis investigates a pragmatic approach to typing, static analysis and static optimization of Web query languages, in special the Web query language Xcerpt[43]. The approach is pragmatic in the sense, that no restriction on the types are made for decidability or efficiency reasons, instead precision is given up if necessary. Pragmatics on the dynamic side means to use types not only to en...
متن کاملIntegrating Correction into Incremental Validation
Many data on the Web are XML documents. An XML document is an unranked labelled tree. A schema for XML documents (for instance a DTD) is the specification of their internal structure: a schema is a tree grammar, and validating a document w.r.t. a schema is done by a running of a tree automaton. Given a document, valid w.r.t. a DTD, and a sequence of updates (insertions, deletions and replacemen...
متن کاملIncremental Validation of Key and Keyref Constraints
We suggest simple update operations on XML documents. These operations may change the value of a simple-type node, or change several values transactionally. We present efficient algorithms for checking the validity of such operations with respect to key and keyref constraints (XML Schema identity constraints). We discuss the implementation of the algorithms for checking the validity of single/m...
متن کاملIncremental Validation of String-Based XML Data in Databases, File Systems, and Streams
Although the native (tree-like) storage of XML data becomes more and more important there will be an enduring demand to manage XML data in its textual representation, for instance in relational structures or file systems. XML data has to be wellformed by definition and additionally, in many cases, it has to be valid according to a given XML schema. Because the XML column types are often derived...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007